Search CORE

58 research outputs found

Must . . . stay . . . strong!

Author: Angelika Kratzer
Anna. Papafragou
Anthony S. Gillies
Aviad Heifetz
Chris. Barker
Christopher Potts
David. Lewis
Eric McCready
Eric. Swanson
Ferdinand Haan de
Immanuel Kant
Jeroen A.G. Groenendijk
Johan. Rooryck
Johan. Rooryck
John. Lyons
Kai Fintel von
Kai von Fintel
Keith. DeRose
Lisa Matthewson
M. Blain Eleanor
O. Hansson Sven
Paul Grice
Paul. Portner
Ronald Fagin
S. Gillies Anthony
Thomas. Willett
Tom. Werner
Y. Aikhenvald Alexandra
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

This is the fourth installment in our trilogy of papers on epistemic modality.It is a recurring matra that epistemic must creates a statement that is weaker than the corresponding flat-footed assertion: It must be raining vs. It’s raining. Contrary to classic discussions of the phenomenon such as by Karttunen, Kratzer, and Veltman, we argue that instead of having a weak semantics, must presupposes the presence of an indirect inference or deduction rather than of a direct observation. This is independent of the strength of the claim being made. Epistemic must is therefore quite similar to evidential markers of indirect evidence known from languages with rich evidential systems. We work towards a formalization of the evidential component, relying on a structured model of information states (analogous to some models used in the belief dynamics literature). We explain why in many contexts, one can perceive a lack of confidence on the part of the speaker who uses must

CiteSeerX

DSpace@MIT

Crossref

An experimental study of the intrinsic stability of random forest variable importance measures

Author: A Altmann
A Kalousis
A Statnikov
A Statnikov
A Verikas
AC Haury
AL Boulesteix
AL Boulesteix
CH Park
D Ma
DM Reif
DS Cao
EC Fieller
Fan Yang
H Wang
Huazhen Wang
I Guyon
I Kamkar
J Paul
JM Cadenas
KK Nicodemus
L Breiman
L Hamers
L Yu
L Yu
LI Kuncheva
MB Kursa
ML Calle
O Okun
R Díaz-Uriarte
R Fagin
R Genuer
S Alelyani
S Loscalzo
S Pleus
SS Lee
SY Kim
TK Ho
VY Kulkarni
Y Han
Y Zhang
Z He
Zhiyuan Luo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

BACKGROUND: The stability of Variable Importance Measures (VIMs) based on random forest has recently received increased attention. Despite the extensive attention on traditional stability of data perturbations or parameter variations, few studies include influences coming from the intrinsic randomness in generating VIMs, i.e. bagging, randomization and permutation. To address these influences, in this paper we introduce a new concept of intrinsic stability of VIMs, which is defined as the self-consistence among feature rankings in repeated runs of VIMs without data perturbations and parameter variations. Two widely used VIMs, i.e., Mean Decrease Accuracy (MDA) and Mean Decrease Gini (MDG) are comprehensively investigated. The motivation of this study is two-fold. First, we empirically verify the prevalence of intrinsic stability of VIMs over many real-world datasets to highlight that the instability of VIMs does not originate exclusively from data perturbations or parameter variations, but also stems from the intrinsic randomness of VIMs. Second, through Spearman and Pearson tests we comprehensively investigate how different factors influence the intrinsic stability. RESULTS: The experiments are carried out on 19 benchmark datasets with diverse characteristics, including 10 high-dimensional and small-sample gene expression datasets. Experimental results demonstrate the prevalence of intrinsic stability of VIMs. Spearman and Pearson tests on the correlations between intrinsic stability and different factors show that #feature (number of features) and #sample (size of sample) have a coupling effect on the intrinsic stability. The synthetic indictor, #feature/#sample, shows both negative monotonic correlation and negative linear correlation with the intrinsic stability, while OOB accuracy has monotonic correlations with intrinsic stability. This indicates that high-dimensional, small-sample and high complexity datasets may suffer more from intrinsic instability of VIMs. Furthermore, with respect to parameter settings of random forest, a large number of trees is preferred. No significant correlations can be seen between intrinsic stability and other factors. Finally, the magnitude of intrinsic stability is always smaller than that of traditional stability. CONCLUSION: First, the prevalence of intrinsic stability of VIMs demonstrates that the instability of VIMs not only comes from data perturbations or parameter variations, but also stems from the intrinsic randomness of VIMs. This finding gives a better understanding of VIM stability, and may help reduce the instability of VIMs. Second, by investigating the potential factors of intrinsic stability, users would be more aware of the risks and hence more careful when using VIMs, especially on high-dimensional, small-sample and high complexity datasets

Crossref

Springer - Publisher Connector

Royal Holloway - Pure

PubMed Central

The Rationale of PROV

Author: Baker
Barga
Bearman
Bose
Buneman
Cheney
Cheney
Ciccarese
Ciccarese
Cohen-Boulakia
Cui
Davidson
Davies
da~Silva
Fagin
Freire
Groth
Hartig
Horrocks
James Cheney
Kwasnikowska
Luc Moreau
Miles
Miles
Minsky
Moreau
Moreau
Moreau
Moreau
Paul Groth
Schmachtenberg
Shaon
Simmhan
Simmhan
Simmhan
Simon Miles
Timothy Lebo
van~der Meyden
Wooldridge
Zhao
Zhao
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

The PROV family of documents are the final output of the World Wide Web Consortium Provenance Working Group, chartered to specify a representation of provenance to facilitate its exchange over the Web. This article reflects upon the key requirements, guiding principles, and design decisions that influenced the PROV family of documents. A broad range of requirements were found, relating to the key concepts necessary for describing provenance, such as resources, activities, agents and events, and to balancing prov’s ease of use with the facility to check its validity. By this retrospective requirement analysis, the article aims to provide some insights into how prov turned out as it did and why. Benefits of this insight include better inter-operability, a roadmap for alternate investigations and improvements, and solid foundations for future standardization activities

Southampton (e-Prints Soton)

Elsevier - Publisher Connector

Crossref

Edinburgh Research Explorer

King's Research Portal

Recommended from our members

Cancer therapy shapes the fitness landscape of clonal hematopoiesis.

Author: Arcila Maria E
Bajorin Dean
Ball Markus
Baselga Jose
Benayed Ryma
Berger Michael F
Bernard Elsa
Berthon Antonin
Bolton Kelly L
Boucai Laura
Braunstein Lior
Caltabellotta Nicole M
Chatterjee Nilanjan
Coombs Catherine C
Devlin Sean M
Diaz Luis A
Druley Todd
Ebert Benjamin L
Fagin James
Farnoud Noushin
Gao Teng
Garcia-Closas Montserrat
Gardos Stuart
Gibson Christopher J
Gillis Nancy
Glodzik Dominik
Gundem Gunes
Hyman David M
Kelly Daniel
Klimek Virginia M
Ladanyi Marc
Lee Choonsik
Levine Max
Levine Ross L
Li Sonya
Mandelker Diana
Mantha Simon
Martinez Juan S Medina
Morton Lindsay M
Norton Larry
Offit Kenneth
Ossa Juan E Arango
Padron Eric
Papaemmanuil Elli
Paraiso Eder
Patel Akshar
Patel Minal
Pharoah Paul
Philip John
Ptashkin Ryan N
Robson Mark E
Scher Howard
Schulman Jessica
Solit David B
Spitzer Barbara
Stadler Zsofia
Stopsack Konrad H
Syed Aijazuddin
Takahashi Koichi
Tallman Martin
Walsh Mike
Yabe Mariko
Young Andrew L
Zehir Ahmet
Publication venue: 'Organisation for Economic Co-Operation and Development (OECD)'
Publication date: 01/11/2020
Field of study

Acquired mutations are pervasive across normal tissues. However, understanding of the processes that drive transformation of certain clones to cancer is limited. Here we study this phenomenon in the context of clonal hematopoiesis (CH) and the development of therapy-related myeloid neoplasms (tMNs). We find that mutations are selected differentially based on exposures. Mutations in ASXL1 are enriched in current or former smokers, whereas cancer therapy with radiation, platinum and topoisomerase II inhibitors preferentially selects for mutations in DNA damage response genes (TP53, PPM1D, CHEK2). Sequential sampling provides definitive evidence that DNA damage response clones outcompete other clones when exposed to certain therapies. Among cases in which CH was previously detected, the CH mutation was present at tMN diagnosis. We identify the molecular characteristics of CH that increase risk of tMN. The increasing implementation of clinical sequencing at diagnosis provides an opportunity to identify patients at risk of tMN for prevention strategies

Apollo (Cambridge)

Following the development of knowledge economies, there has been a rapid expansion of economic analysis of knowledge, both in the context of technological knowledge in particular and the decision theory in general. This paper surveys this literature by identifying the main themes and contributions and outlines the future prospects of the discipline. The wide scope of knowledge related questions in terms of applicability and alternative approaches has led to the fragmentation of research. Nevertheless, one can identify a continuing tradition which analyses various aspects of the generation, dissemination and use of knowledge in the economy

Crossref

Online Research @ Cardiff